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Abstract. Wg show that the values of entropies of multidimensional shifts of 
finite type (SFTs) are characterized by a certain computation-theoretic prop- 
erty: a real number /i > is the entropy of such an SET if and only if it is 
right recursively enumerable, i.e. there is a computable sequence of rational 
numbers converging to h from above. The same characterization holds for the 
entropies of sofic shifts. On the other hand, the entropy of an irreducible SET 
is computable. 



1. Introduction 

A shift of finite type (SFT) is an ensemble of colorings of Z (a one-dimensional 
SFT) or Z'' for d > 1 (a multidimensional SFT) defined by local rules. SFTs are one 
of the fundamental objects of study in symbolic dynamics, and their most significant 
invariant is their (topological) entropy, which measures the asymptotic growth of 
the number of legal colorings of finite regions (see section[2]for definitions). Besides 
having been been studied extensively from a dynamical perspective as topological 
analogs of Markov chains [2TJ [281 [21] , SFTs appear naturally in a wide range of 
other disciplines. In information theory. SFTs were used by Shannon as models 
for discrete communication channels [24j . for which entropy describes the capacity; 
similarly, SFTs model "two-dimensional" channels SFTs have been used to 
study the dynamics of geodesic flows and have played an important role in the 
classification of the dynamics of Anosov and Axiom A diffeomorphisms [H [3] , where 
entropy is again a fundamental invariant. In mathematical physics SFTs are often 
called hard-core models, and are used to model a wide variety of physical systems; 
this is the thermodynamic formalism [23j . In this setup it is of central importance 
to understand the equilibrium states of the system, which in are the invariant 
measures of maximal entropy. 

It is well known that in the one-dimensional case the entropy of an SFT may 
be effectively calculated, since it is the logarithm of the spectral radius of a certain 
positive integer matrix which is derived from the combinatorial description of the 
system. D. Lind [16] has given an algebraic characterization of the numbers which 
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arise as entropies of one-dimensional SFTs. A Perron number is a real algebraic 
integer greater than 1 and greater than the modulus of its algebraic conjugates. The 
entropies of one-dimensional SFTs are precisely the non-negative rational multiples 
of logarithms of Perron numbers. 

In higher dimensions the problem becomes much more difficult. The dynamics 
of multidimensional SFTs is vastly more complicated than their one-dimensional 
counterparts. For instance, irreducible multidimensional SFTs may have more than 
one measure of maximal entropy [3] , and zero entropy can coexist with rather com- 
plex dynamics [20J . In general it is undecidablc whether a given set of rules define a 
nonempty SFT [2l [22] . Regarding the entropy, even when the rules defining an SFT 
enjoy good symmetry properties, calculating the entropy is usually beyond current 
technology. As a result numerical methods have been developed to approximate 
the entropy (e.g. [10]), but these usually apply to a restricted class systems. 

One should note that for certain Z'^-actions which arise as automorphisms of 
compact groups (a class which includes some SFTs), explicit expressions for the 
entropy have been obtained by D. Lind, K. Schmidt and T. Ward jT8]. We note 
however that while these expressions are explicit they do not provide much infor- 
mation on the properties of the entropies, e.g. whether they are algebraic, well 
approximable, etc. 

In this paper we characterize those real numbers which can occur as entropies 
of multidimensional SFTs in terms of their computation-theoretic properties. It is 
natural to say that a real number h is computable if it can be calculated to any 
desired accuracy. More precisely, h is computable if there is an algorithm which, 
given input n G N, produces a rational number r{n) with \h — r(ri)\ < 1/n. For 
example, every algebraic number is computable (since there are numerical methods 
for computing the roots of an integer polynomial), and so are e,7r, since they can 
be written as power series with computable coefficients and rate of convergence. 

A weaker notion is the following. A real number h is right recursively enumerable 
(sometimes called upper semi recursive) if there exists a Turing machine which, 
given n, computes a rational number r{n) > h such that r(n) — > h (equivalently, 
the right Dedekind cut {(/gQ : q > h} is a. recursive set of rationals). 

The class of right recursively enumerable numbers is countable since algorithms 
may be put in one-to-one correspondence with finite 0, 1-valued sequences, and 
hence there are only countably many of them. If h is computable then there is 
an algorithm computing r(n) with \h — r{n)\ < i, so the computable sequence 
r(n) -I- i converges to h from above. This shows that the class of right recursively 
enumerable numbers contains the computable numbers, and it can be shown to be 
strictly larger. For more information, see [15j . 
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Theorem 1.1. For d>2 the class of entropies of d- dimensional SFTs is the class 
of non-negative right recursively enumerable numbers. 

The property of right recursive enumerabihty is a necessary condition for a num- 
ber to be the entropy of an SFT because the naive approximation algorithm, which 
counts locally admissible patterns on cubes, converges from above to the entropy. 
This follows from the work of Friedland [S] ; we provide a different proof below. The 
main novelty here is the sufficiency of the condition. 

A sofic system is a factor of an SFT, i.e. an ensemble of colorings of Z'' obtained 
from a fixed SFT X by applying a local transformation to each coloring in X (for a 
definition, see section [2]). In the one dimensional case. Coven and Paul |S] showed 
that every sofic system Y can be extended to an SFT X with of the same entropy 
as Y . In particular, this implies that the class of entropies of sofic shifts is the same 
as that of SFTs. Whether the covering theorem is true in the multidimensional 
case is still open and seems quite hard (see [7] for a partial result). However some 
circumstantial evidence in favor of the covering theorem is provided by the following: 

Theorem 1.2. For d > 2, the class of entropies of d- dimensional sofic shifts is the 
same as that of d- dimensional SFTs. 

This is a consequence of the fact that the entropy of sofic shifts is right recursively 
enumerable (corollary 13.31 below) , and the fact that an SFT is in particular a sofic 
system. 

It is worth emphasizing that since there arc non-computable numbers which are 
right recursively enumerable, it follows from theorem 1 1 . 1 1 that there arc SFTs whose 
entropy cannot be computed effectively (a similar situation is known for cellular 
automata |T3] and general subshifts [H]). However, if one assumes strong enough 
mixing properties of the system the situation improves. Recall that an SFT is 
irreducible if any two admissible patterns far enough apart may be extended to the 
whole lattice (see section [2]) • 

Theorem 1.3. The entropy of an irreducible SFT is computable. 
We do not know if this condition is also sufficient. 

The rest of this paper is organized as follows. In the next section we introduce 
some notation and background. In section [3] we prove that the entropy of any 
SFT or sofic shift is right recursively enumerable, and that of an irreducible SFT is 
computable. In section[4]we outline the construction which constitutes the proof of 
the other direction of theorem ll.il Sections [B]-[5] give the details of the construction. 
In section [9] we discuss discuss some open problems. 
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2. Preliminaries 

In this section we provide some background from symbolic dynamics and define 
SFTs and entropy. See [17l [14] for more information on these subjects. 

2.1. Subshifts and SFTs. For a finite alphabet S let E^'' be the space of E- 
colorings of Z'' (this is called the full shift on E). For a subset F C Z'^ we refer to a 
function a E E^ as a coloring of F or an F-pattern. We say that patterns a E E^ 
and b E E^+" are congruent if a{v) ~ b{v + u) for every v E F. We say that a 
pattern a E E^ appears at w in a pattern b E E^ if b\F+u and a are congruent. 

li E C F and a E E^ then a induces a coloring of E by restriction, namely a\E- 
For a finite set F C Z'' and pattern a E E^ the cylinder set defined by a is 

[a] ^{xE E^' : x\f = a} 

We endow E^'' with the product topology, which is generated by the cylinder sets 
and makes E^ into a compact metrizable space. 

For u eU^ let cr" : E^ ^ E^ be the homeomorphisms 

(cr"(a;))(w) =.t(w + u) v e Z'^ 

This gives an action of on E^ called the shift action. A subset X C E^ is 
invariant under the shift action if (t"(X) ^ X for every u Ell^ . A closed invariant 
set X C yF" is called a Z'^-subshift. 

A c?-dimensional subshift of finite type (SFT) is defined by a finite alphabet E, 
a finite set F C Z'', and a collection L C E^ of E-colorings of F, called the syntax. 
A E-coloring x E E^ of Z*^ is admissible for F if the pattern induced by x on every 
translate of F is congruent to a pattern in L. The SFT defined by L is the set 
X C E^ of all admissible x. From the definition it is clear that an SFT is closed 
and shift-invariant. 

Given an SFT X defined by a syntax L C E^, we say that a finite pattern is 
globally admissible for X if it appears in A". In contrast we say that a pattern 
a E E^ is locally admissible if a\F+u is congruent to a pattern in L whenever 
F + 7i C F. A globally admissible pattern is locally admissible, but the latter is 
not true in general. 

An SFT A C E^ is irreducible if there is a constant r > 0, called a gap, such 
that for every A,BC_li'^ satisfying ||u — v\\^ > r ior u E A , v E B, and for every 
pair of globally admissible a E E"^ and b E E^, there is a point x E X with x\e = a 
and x\b ~ b (in other words, a U 6 is globally admissible). 
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2.2. Topological entropy of subshifts. For a subshift X C and F C 'Z'^ we 
say that a pattern a E appears 'm X ii a = x\p for some .t e X. For a set F let 
Nx{F) denote the number of distinct S-colorings of F which appear in X. Let 

denote the discrete d-dimensional cube of side n. The (topological) entropy h{X) 
of X is defined by 

hiX) = hm ^logNx{F„). 

By convention the logarithm is to base 2. The limit above exists, and is in fact 
equal to inf„gN log Nx{Fn). 



2.3. Products, factors and isomorphism. Let X C and Y C be two 

Z'^-subshifts. The product system X xY <Z (S x A)^'' is then a symbolic Z'^-systcm 
also, and satisfies h{X x F) = h{X) + h{Y). 

A continuous onto map Lp : X ^Y \s called a factor map if it commutes with the 
action, i.e. cr" o = o cr" for all u G Z''. An isomorphism is an invertible factor 
map. Both entropy and the property of being an SFT are invariants of isomorphism 
(although isomorphic SFTs are usually not defined by the same syntax), as is 
irreducibility. 

A factor of an SFT is called a sofic system. In general a sofic system is not an 
SFT . 

Every factor map Lp : X ^ Y arises from a so-called block code, which means 
the following: There exists a finite set F C Z'* and a function tpo : A such 

that 

{^{x)){u)^M{a^^x)\F) 

Conversely, given such a ip^ we can define </? by this formula, and then </? is a factor 
map from X onto its image. 

A factor map tt : X ^ Y oi symbolic systems X C E^*^ and Y C A^'' is called 
a one-block map if it is determined by a single symbol, i.e. it is induced by a map 
(/3o : S ^ A. We will always assume our factor maps are one-block maps. There 
is no loss of generality in this since given a factor map p : X ^ Y there is a 
system X' isomorphic to X via an isomorphism tjj : X' X so that the factor map 
tp(p : AT' ^ y is a one-block map. 

Similarly, an SFT is called one-step if it is defined by a syntax L C E^^'^^ . Every 
SFT is isomorphic to a one-step SFT. Note that for a one-step SFT, if a e E^" 
and b e are globally admissible patterns and they agree on the boundary 

of Fn (i.e. a\p^\p^_-^ ~ &|_f„\f„_i) then a U 6 is globally admissible. 
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2.4. Invariant measures and entropy. Given a symbolic system X, a Borel 
measure on X is invariant under the shift action if /i(cr"(A)) = for every 

Borel set ^ C X and every u . We denote the set of invariant Borel probability 
measures by M.{X\ The weak-* topology on M.[X) is the topology in which 
Mn — > /i if J fd^n J fdn for every continuous function / on X. This makes 
M{X) into a compact metrizable space. 

For /Lt € M{X) we denote by its measure-theoretic entropy by h{fi). We recall 
the following facts: 

(1) The entropy function h : A4{X) — > is upper semi-continuous. 

(2) The variational principle: h(X) = ixiax^^nnx) ^(m)- 

See [5] for definitions, proofs and a detailed discussion of the one-dimensional case, 
or [19] for a proof of the variational principle in the multidimensional case. 

3. COMPUTABILITY OF ENTROPIES 

In this section we show that the entropy of an SFT is right recursively enumer- 
able. This follows from the work of Friedland [9], but for completeness we give an 
short alternative proof and extend the result to sofic systems. We also prove that 
the entropy of an irreducible SFT is computable. 

Let the syntax L C define a (possibly empty) SFT X. The definition of 
entropy provides us with the sequence iV„ = Nx{Fn) such that ^logA^„ which 
converges to h{X) from above, and if Nn is computable this sequence shows that 
h is upper-semi recursive. However, iV„ is not computable in general. Indeed, 
determining whether Nn > is equivalent to deciding if the SFT defined by L is 
nonempty, and this is in general undecidable [22l [2] . 

Let us say a finite pattern a £ S^" is locally admissible if a\F+u is congruent to 
a pattern in L whenever F + u C Fn- Instead of Nn, consider the sequence 

Nn = #{locally admissible i^„-patterns} 

Clearly Nn is computable. If x E X then a;|i^„ is one of the patterns counted by 
Nn, so Nn > Nn- The inequality can be strict, because not all locally admissible 
i^„-patterns need arise in this way: there can be locally admissible finite patterns 
which don't extend to globally admissible coloring of Z''. Nonetheless, 

Theorem 3.1. For L,X and Nn as above, ^logiV„ — > h{X) from above. Conse- 
quently, h{X) is right recursively enumerable. 

Proof. Denote 

h = lim sup — log Nn 

n — >oo ri 
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Since Nn > Nn and ^ log A^„ > h{X), we have ^ logiV„ > h{X), so it suffices to 
show that h < h{X). 

Define a sequence of measures Vn on S^"* as follows. Let Wn ^ S^" be the set of 
locally admissible colorings of F„. Let /i„ denote the probability measure obtained 
by coloring each translate Fn + u for u e nZ,'^ independently and uniformly with 
patterns from Wn- Let f„ = Smsf '^"t^n- Then j/„ is an invariant probability 
measure and its entropy is easily shown to be 

Let h'n{k) b6 a subsequence such that h{vn(k)) ~* h and let be a weak-* accumula- 
tion point i'n(k) ] we may assume Vn(k) ^ ^- Since entropy is upper semi-continuous 
in the weak-* topology, we have 

h{v) > h 

On the other hand we claim that iy{X) = 1, so can be regarded as an invariant 
probability measure on X. To show this, we prove that i'([a]) = for any a e I]^\L 
, where [a] is the cylinder set defined by a. Indeed, for every k and u G Fk, if 
(F + u) C Fk then /ifc(cr-"([a])) = 0, so 

1 „r . k'^ - (k - dia.mF)'^ 
< ^#{" e Fk : {F + u)% F,} < L_ L 

where diami^ is the diameter of F with respect to the norm = maxi=i...rf \ui\. 

It is easy to see that i^fe([a]) = /ifc([a]), so 

i/([a]) = lim J^„(/c)(N) = 

Finally, the variational principle implies that h{i/) < h{X), and the theorem follows. 

□ 

With the same notation as above, let Y C A^** be a symbolic factor of X arising 
from a one-block map ipa : ^ A and its pointwise extension ip : X ^ Y. Write 

M„ = #{(/j(a) e A^" : a e Wn} 

where as before, Wn is the set of locally admissible F„-patterns for L. 

Theorem 3.2. With the above notation, ^p^logM„ — ^ from above. Conse- 

quently h{Y) is right recursively enumerable. 

Proof. Denote h{Y) = lim sup ;Jg- log A/„ . Since ip is onto we have Af„ > NyiFn), 
so ^ logMn > h{Y). Thus we only need to show h{Y) < h{Y). 

Let 9k be measures on A^"^ defined by coloring each translate Fk+u for u e nZ'^ 
with patterns drawn uniformly from {^{0) : a e Wk}. Then r]k = J2ueFk ^^^k 
is an invariant measure on A^ , and h{rjk) — -p-logA/fc. Let be measures on 
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such that the pattern on Fk + u for u E kl/ is drawn from Wk according to a 
distribution which projects under Lp to the uniform distribution on {^p{a) '■ a G W^}- 
Thus 9k = if(pik)- Let Vk ~ J2ueFk "'"/^'^i t'^^t Vk — '^{vk). Choose a subsequence 
^n(fc) so that there is a measure 77 on y with r/„(fc) — > r/ and there is a measure 1/ 
on X with ^'„(fc) z^; so ?7 = ^p{v) satisfies 77„(fc) —> ry. By upper semi-continuity, 
h{ri) > hm h(rin(k) ) ~ h, and so we will be done if we show that rj is supported on Y. 
For this it is enough to show that 1/ is supported on X, i.e. that i'{[b]) = whenever 
b E \ L. The proof of this is identical to the proof of the same statement at the 
end of theorem 13. II □ 

Corollary 3.3. The entropy of every sofic shift is right recursively enumerable. 

Proof. As noted in section[21 every sofic shift is a one-block factor of some SFT. □ 

We turn now to irreducible SFTs and the proof of theorem 11.31 Let Q„ = 
{—n,—n+ l,...,n— l,7i}'' denote the symmetric cube; note that Nx{Qn) = 
Nx{F2n+i)- For an SFT X we say that globally admissible patterns a £ 
and 6 E S'^" are r-compatible ifn> fc + r + l and a U {b\Q^\Q^_^^) is globally 
admissible. Note that if r < s and a E TP'' and b E TP" are r-compatible, then 
they are s-compatibe, provided n> k + s + 1. Clearly if X is irreducible with gap 
r then every two such patterns are r-compatible. 

Lemma 3.4. Let X C be a non-empty irreducible SFT and a E S'^'-'. Then 
precisely one of the following holds: 

(1) a 7^ 6|qj, for every large enough N and every locally admissible b E E*^™ . 

(2) For every large enough N and locally admissible b E E*^™, the patterns a,b 
are \/N -compatible. 

In the first case a is not globally admissible; in the second it is. 

Proof. Fix a E E'^''. By compactness, a is not globally admissible if and only if (1) 
holds. 

Suppose now that a is globally admissible for X and let r be a gap for X. We 
may assume that X is a one-step SFT. For every b E TP''+^+^ , if a, 6 are not r- 
compatible then by irreducibility b is not globally admissible. Hence by part (1) 
we see that for large enough the pattern 6 does not appear at the origin in any 
locally admissible c E E'^" . Since there are finitely many such 6's, we see that 
a and c are r-compatible for large enough N and for locally admissible c E E''^". 
Since ^fN > r eventually, this implies the (2). 

Finally, (2) implies that a is admissible by irreducibility and tha fact that X is 
non-empty. □ 

Corollary 3.5. For a non-empty irreducible SFT X it is decidable whether a finite 
pattern a is globally admissible. 
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Proof. To decide if a is globally admissible, find the first for which one of the 
conditions of the proposition holds (the conditions arc finitely checkable). If (2) 
holds then a is globally admissible; otherwise it is not. Note that to apply the 
proposition one does not need to know the gap. □ 

A number h is left recursively enumerable if there is an algorithm which, given 
71, produces a rational number s{n) with s{n) h and s{n) < h. If /i is both right 
and left recursively enumerable then it is computable. To see this let r(n),s(n) 
be computable sequences with s{n) < h < r{n) and r{n),s{n) h. Now given 
n, we can calculate r(fc),s(fc) for k = 1,2,3... until such a fc is reached that 
r(fc) — s{k) < i. Then r(fc) satisfies \r{k) ~ h\ < ^. This algorithm shows that h 
is computable. 

We can now prove theorem II. 31 which we repeat here for convenience: 

Theorem. The entropy of an irreducible SFT is computable. 

Proof. Let X be an irreducible SFT, and we may assume it is non-empty. We 
already know that h{X) is right recursively enumerable, so it suffices to show that 
it is left recursively enumerable, i.e. to exhibit an algorithm which given n e N 
returns a rational number s(n) such that s{n) — > h{X) and s{n) < h. 

The algorithm is as follows. First, identify all the globally admissible patterns 
ai, . . . , afc(„) G S*^" (this is computable by corollary above). With this notation we 
have k{n) — Nx{Qn) and ■|^^logfc(n) h{X). Next, find the smallest number r' 
so that each globally admissible pattern b E Y^Qn+r'+i is r'-compatible with for 
1 = 1,..., k{n). Set 

= 1 logfc(n) 

I ^n-\-r' I 

Note that r' < r, where r is a fixed gap for X. Hence 

sin)>-j^--^logkin)^h{X) 

\Wn+r\ |Vn| 

On the other hand, consider a large Qm, and consider the collection of translates 
of Qn by elements of the lattice 2(n + r')Z'' which fall inside Qm- By choice of r' 
we can color each of these translates in an arbitrary globally admissible way and 
complete it to a globally admissible Qm pattern. Since the number of translates is 
, | IQml (for convenience assume that m is a multiple of 2(n + r'), we see that 

k{m) > fc(n)^'^'''^"' 

so letting m — ^ oo, 

<n) = TT^Mn) < ^logfc(m) -> h{X) 
hence s{n) < h{X), and also s{n) h{X), as desired. □ 



Note that the algorithm given in the proof does not require prior knowledge of 
a gap for the X. It may of course be applied to any SFT, but in that case may 
not halt on some inputs, and even if it does the sequence s{n) will not necessarily 
behave as above. 

4. Outline of the main construction 

Let ft, be a right recursively enumerable number. To prove the remaining direction 
of theorem II. li we must construct for every d > 2 a d-dimensional SFT with 
entropy h. We first make some simplifying assumptions. We may restrict ourselves 
to dimension 2. since given an SFT X C the system X' C defined by 

X' = {x' e E^"^' : Vj e Z3a; e XVu e H^x'iu,]) = x{u)} 

is easily seen to be a d + 1-dimensional SFT and h{X') = h{X). Furthermore, since 
(a) the product of SFTs is an SFT, (b) h{X x y) = h{X) + h{Y) and (c) n is the 
entropy of the full shift on 2" symbols, it suffices to prove the statement under the 
assumption that h G [0, 1]. 

Our construction has three main steps: 

Step 1 : Constructing the base (section [6]): We construct an SFT X some 
of whose symbols are marked 0, 1, and such that the density of I's in each 
point of X is very uniform. It will be possible to estimate this density by 
observing any sufficiently large and well-distributed set of coordinates. 

Step 2 : Pruning (section [7]) : In this step we "kill" all points x ^ X such 
that the frequency of I's in x is strictly greater then h. In this way we 
obtain an SFT Y such that the symbol 1 appears in each y ^ Y with 
frequency at most h, and for some points the frequency is h. Furthermore, 
Y will still have zero entropy. We achieve this by superimposing another 
layer on top of X which represents calculations of a certain Turing machine, 
using as input the underlying patterns from X. This machine halts when 
it detects a density of I's greater than h. The result is that a point x £ X 
with density of I's greater than h cannot be extended to a pattern in Y] 
otherwise, it can be. 

Step 3 : Adding "Random" bits (section H): We extend Y to an SFT 
Z by allowing two new symbols, say "a" and "/?" , to appear independently 
over every occurrence of a 1 in Y . This system Z has entropy h. 

For steps 1 and 2 we utilize certain SFTs with special geometric and arithmetic 
properties. The existence of such systems, and their use in representing Turing 
machines in SFTs, appears first in Robinson's paper [22]. However, we will not 
refer directly to Robinson's construction, which would in any case require some 
modification to suit our needs. Instead we rely on a theorem of Mozes |20] about 
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the realization of substitution systems by SFTs. This theorem, which allows us 
to easily construct variants of Robinson's system, is presented in the next section 
together with another technical definition. Following that we give the details of 
steps 1, 2 and 3. 

Before moving on, we note that our arguments give the following result, which 
may be of independent interest: 

Theorem 4.1. A real number r > is right recursively enumerable if and only if 
there is an alphabet S, a symbol a G S and an SFT X CT,^ such that 

sup lim 7^#{it e Fn ■■ x{u) = a} = r 

(and in particular the limit above exists for every x (z X). Furthermore if r is 
computable then one can find E, a, X so that lim„^oo j^^{u G Fn '■ x{u) ~ a} ~ 
r for every x X . 

5. Substitutions and superpositions 
In this section we describe two technical devices for constructing SFTs. 

5.1. Subshifts defined by Substitution. Given a finite alphabet E, a substitu- 
tion rule is a map s : E — > S^*-' for some integer fc > 1, where Fk = {1, . . . ,k} x 
{1, . . . , fc} (in the terminology of [20], this is a deterministic k x k substitution sys- 
tem with property A). The map s extends naturally to a map s„ : E^" — > E^" *= 
by identifying Y.^" '' with (E'^'=)^". 

Starting from a single symbol located at (1,1) G and iterating the substitution 
map, we obtain a sequence of colorings of Fk^ for n = 0, 1, 2 . . .. Such patterns are 
called s-blocks. A point x G E^ is admissible for s if every finite subpattern of 
X appears in some s-block. The subshift W C E^^ associated with s is the set of 
admissible patterns; this is seen to be closed and shift invariant. 

Define Sao : W W hy applying s to each symbol of x; more precisely, 
Soo{x){u) ~ s{x{u'))(u"), where u' G I? and u" G Fk are the unique vectors 
such that u — ku' + u" . Clearly Soo maps W into W. We say that x is derived 
from y if T"x = Soo{y) for some v £ Fk- It is not hard to show that each x G V7 is 
derived from some y G W; if this y is unique, we say that s has unique derivation. 

Theorem 5.1. (Theorem 4-5 of |20j ) Let s : —> E^*" be a substitution rule with 
unique derivation and let W be the associated dynamical system. Then there exists 
an alphabet A, an SFT W C , and a one-block factor map (p : W ^ W . 
Furthermore ip is an injection on a set having full measure with respect to every 
invariant measure on W . 

Note that theorem 15.11 is false in dimension d= 1. 
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Proposition 5.2. If s,W and W are as in the theorem then h{W) = 0. 

Proof. For any fi invariant on W, the map ip is an isomorpliism of dynamical systems 
between {W,fj.) and {W,(piJ,) where (fji is the push-forward of fj. to W. Hence it 
suffices to show that the latter system has zero measure-theoretic entropy. By the 
variational principle it suffices to show that h(W) ~ 0. Fix m. Since every large 
enough s-blocks is composed of an array of smaller s-blocks of dimension k™ x fc™ 
arranged in a square, it follows that for n > k"^ an admissible i<"„-pattern can be 
decompose F„ into ([-jSr] — 2)^ disjoint s-blocks of dimension fc™ x fc™ together with 
a "small" remaining region near the boundary. Thus the number of Fn patterns is 
at most 

Nf„{W) < #{k X k s-blocks}(["/'='"l-2)' . lEl^"'^'" 

where the second term on the right hand side is the number of ways to fill in the 
region near the boundary of Fn not covered by the s-blocks. Since there are only 
different s-blocks of dimension fc™ x k™ (because each is derived from one of 
the original symbols), for all large enough n we have 
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k" 



as m was arbitrary, h{W) ~ 0. □ 

We use theorem lS.l) which is due to Mozes, to construct systems similar in many 
respects to Robinson's system from j22j. We remark that although this allows a 
more economical exposition the gain is cosmetic. Indeed, the proof of theorem 15. II 
relies on an elaborate extension of Robinson's techniques. There has recently been 
a revival of interest in substitutions and their realization using local rules; see e.g. 



5.2. Superposition. Given an SFT X defined by a syntax L, superposition is 
a syntactic process which gives an SFT X' which factors onto a subshift of X. 
Informally, this is done by adding data to each symbol of X and enriching the 
syntax with rules relating to this new data. 

More precisely, suppose X is an SFT defined by a syntax L C E^. A system Y 
is superimposed over X if it is obtained by the following process, (a) For a finite 
set A, we replace each symbol of cr e S with one or more symbols if the form 
(cr, (5) G S X A. Let E' be the set of these pairs. For the new symbol (cr, 5) G E', 
we say that 6 is superimposed over cr; we also frequently refer to this pair as the 
symbol a marked with 6. (b) We extend each pattern a G L E^ one or more 
patterns a' £ (S')^ by superimposing new symbols over each symbol of a. Call 
the new syntax L' . The SFT X' defined by L' has the property that every pattern 
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Figure 6.1. Three iterations of the substitution. 



appearing in X' consists of a A-pattern superimposed over a E-pattern, and the 
E-pattern is admissible for X. 

Note that the map tt : X' E^ which erases the superimposed layer of data 
maps X' into a subsystem of X . We say that x ^ X is represented in X' if one 
can turn x into a point of X' by superimposing a suitable A-pattern over x; i.e., if 
X = tt{x') for some x' G X'. 

6. Step 1: Constructing the Base 

In this section we construct a two-dimensional SFT X whose symbols are marked 
with the symbols 0, 1. The symbol 1 may appear with any density in points of x, 
but for each fixed x Cz X the density of I's will be extremely uniform. 

6.1. An almost periodic SFT. Consider the substitution on the alphabet {o,»} 
defined the the rule 




Let W denote the dynamical system defined by these rules. See figure [^Tl 
We say that a set i? C is a 2-net E = W^^iln x J„ where each /„ and 
Jn are translates of 2"Z, the /„'s are pairwise disjoint, and the J„'s are pairwise 
disjoint. We refer to /„ x J„ as the n-th level associated with E. Note that if u 
belongs to some level of E then the row and column to which u belongs do not 
intersect any other level. 

Proposition 6.1. Let w G W and 

E{w) = {uel? : w{u) = •} 
Then E = E{w) is a 2-net. 

Proof. For n = 0, 1, 2, . . . let a„ be the sequence of i^2" square patterns obtained by 
applying the substitution rule to the initial symbol oq = •. It is sufficient to show 
that there is a 2-net E = U^^^In x J„ such that {u S : a„(u) = •} = ETl 
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To verify this, one proves by induction that the above holds for 

I„ = J^ = 2"Z + 2"-i □ 

We remark that the system W supports a unique invariant probabihty measure 
and as a measure preserving system this is an odometer, i.e. is isomorphic to a 
zero-dimensional abelian group along with a free minimal action generated by 
translation by two elements of the group. 

This substitution rule has unique derivation, since one may check that there is 
unique way to derive the central 6x6 square of the 8x8 pattern in figure [6Tl from 
a 4 X 4 pattern. 

Let W be the SFT associated to W by theorem 15. II Then to each point in W 
there is associated, via a one-block map, a {o,»} pattern defining a 2-net. 



6.2. Marking the columns of W. We now superimpose another layer on top of 
W. Begin by superimposing the symbols 0, 1 on top of the W with the constraint 
that the symbols 0, 1 cannot be placed vertically adjacent to each other. This forces 
each column in the resulting system to be marked either entirely with O's or entirely 
with I's. 

For a point w G W, the new coloring induces a {0, l}-coloring of each level I x J 
in the decomposition given by the proposition. This coloring is constant on the 
intersection of /x J with columns; we now force it to be constant on the intersection 
of the grid with rows. For this, superimpose two new symbols "< — >" ," <;=^" on top 
of the existing ones. We think of < — > as transmitting a "0" signal, and of as 
transmitting a "1" signal. The rules are that over a symbol marked •, the symbol 
< — > appears always together with the symbol 0, and <=> appears always together 
with the symbol 1. We also require that < — > and <;=4> cannot appear as horizontal 
neighbors, so the arrow type is constant on rows. 

Call the resulting system X (it is of course an SFT) and let x E X be superim- 
posed over a point w € Wq. Let / x J be some level of the 2-net induced by w, and 
suppose that w{u) is marked for some u € I x J. .Since it is also marked •, it 
bears the symbol < — > (and not this forces the entire row to which u belongs 

to be marked with < — >. Every other v E I x J belonging to the same row is thus 
marked • and < — >, and so it must be marked 0. A similar analysis holds if w(u) is 
marked 1. 

In short, the 0, 1-coloring of each grid / x J is constant on rows and columns, and 
thus is completely constant. If /„ x J„ are the levels of the 2-net induced by a point 
X E X then each /„ x J„ determines a collection of columns which is 2"-periodic in 
the horizontal direction, and all these columns bear the same symbol or 1. 
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For X G X, let S{x) be the upper density of I's in i.e. 

6{x) = hmsup — 

where as usual Fn = {1, . . . , n}^. If In x are the levels of the 2-net induced by 
then a simple calculation shows that 

oo 

where p„ is or 1 according to the coloring x induces on /„ x J„ . Since the /„'s and 
J„'s are pairwise disjoint the arrows transmitting information between the points 
of each grid occupy different rows, and hence don't interact. Therefore, we are 
free to color each level or 1 independently of the coloring of the other levels. 
Consequently, any sequence /?„ e {0, 1} may arise, so there are points x € X with 
S{x) taking on any value in the range [0, 1]. 

We will call a point in X exceptional if it is superimposed over an exceptional 
point of W. For an exceptional point x ^ X there are complementary half-spaces 
and/or quarter-spaces such that the restriction of x to each of them looks like a 
non-exceptional point. Thus the above analysis applies to each of these regions 
separately. This is not to say that we can glue admissible half- and quarter-spaces 
together arbitrarily, and indeed for exceptional points the arrows from different 
parts can interact; but this will not matter to us. 

Finally, we claim that X has zero entropy. Indeed, W has zero entropy, and it is 
simple to check that if a is a square pattern admissible for W then every extension 
of a to a pattern b admissible for X is determined by the symbols of b on the 
boundary of the square. It follows that X has entropy 0. 



7. Step 2: Pruning 

Let /i be a fixed right recursively enumerable number. Let X be the system 
constructed in the previous section. Our goal in this section is to construct an SFT 
Y superimposed over X which "kills" points with density of I's greater than h. 
More precisely, we will want 

sup{5(y) : y eY}^ h 

(here 6 is the natural extension of 6 from X to Y) and that the supremum will be 
achieved. 

7.1. Boards. We define a substitution system over the alphabet 

J, T,±,h,H, +,■,□} 
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The substitution rules are described in figure 17.11 together the symmetric rules 
obtained by rotating by multiples of 90°. Let us denote by 6„ the 5" x 5"-pattern 
obtained by applying the substitution rule n times to the symbol I; see figure [73] 
It is not hard to show that ■ appears with period 5 in every &„. Given k and n > k, 
since I appear in bn^k with period 5 we see that bi appears in bn-k+i with period 
5*, so bk appears in with period S'^. 

As can be seen from figure 17.11 this substitution rule produces patterns which 
induce certain grid-like shapes on . More precisely, define finite sets C N 
inductively by Ii ~ {1,2, 4, 5} and 

In+i = In U (/„ + 5") U (/„ + 3 • 5") U (/„ + 4 • 5") 

One sees by induction that min/„ = 1 and max/„ = 5", so the union above is 
disjoint, and hence |/„| = 4" for each n. Let 

B„ = (/„ X {1,2,...,5"})U({1,2,...,5"} x/„) 

This is the set obtained by "filling in" the rows and columns between points of 
In^ In- The set i3„ is called an n- board. An infinite board is any set B Ql? which 
is the limit of a sequence of translates _B„ + Un for some u„ G Z^, where by limit 
we mean that u e _B if and only if eventually u G -B„ + It is simple to check 
that every infinite board B has density zero, i.e. for every e > there is an N so 
^^^IFjvl^"'*^ ^ ^'^'^ every u ^1? . This follows from the recursion formula for /„ . 

Consider the patterns 6n|B„. One shows by induction that these patterns do 
not contain the symbols ■, □. Also, for u £ Bn the points v G Bn which are 
adjacent to u - i.e., which differ from u by ±ei or ±62 - are determined by bn{u) 
by interpreting the symbol bn(u) as a collection of lines pointing to the neighbors 
of u in Bn- Thus, ± indicates that there are neighbors left, right and above the 
current symbol; — indicates neighbors to the left and right of it; etc. One can show 
that if M € i^s.. = {1, . . . , 5"} X {1, ... , 5"} and 6„(u) ^ {■, □}, then there is a 
unique k and translate A of Bk so that u € A and 6„|^ is congruent to bklsk- Iii 
the large square in figure FTT] there are two boards visible; a 1-board in the center, 
and a 2-board surrounding it. If we iterate the substitution one more step, each 
■ will turn into a 1-board plus H's, the 1-board will turn into a 2-board plus H's, 
and the 2-board will turn into a 3-board, plus B's. 

Let R denote the dynamical system defined by these rules. From the remarks 
above it follows that each r G R determines a pairwise disjoint collection of boards, 
with n-boards appearing periodically with period 5"; and if u G r and r(u) ^ 
{■, □}, then u belongs to one of these boards and the neighbors of u in this board 
can be determined from symbols r(u). By compactness, there will exists points 
r G R and infinite boards B so that rjs is marked similarly to a finite board. Since 



16 



h + — hH 
I I ■ I I 

h + - + H 



rn rn 

h + — hH 
I I ■ I I 

h + - + H 



L. 



L. 



r ~r " 



■ + + + + h + - 



■ + H — + + + + ■ 



"~r n 

■ + H 



hH — + + + + — h + - 
hH — + + + + — h + - 
hH — + + + + — h + - 

hH — + + + + — h + - 



■ + H — + + + H hH 

■ + H — + + + H hH 

■ + H — + + + H hH 

- + H — + + + H hH 



hH — + ■ 



■H — + + + + ■ 

















r ~i r n 






























h + — hH 


















■ 








■ 




■ 




■ 








■ 


















h + - + H 
































■ 








■ 






■ 




■ 








■ 



hH — + + + + — h + - 
hH — + + + + — h + - 
hH — + + + + — h + - 



■ + + + + ■ 

■ + + + + ■ 

■ + + + + ■ 

■ + + + + ■ 



■ + H 

■ + H 

■ + H 



hH — + + + + — h + - 



■ + H — + + + + ■ 



L. 



■ + H 



Figure 7.1. The substitution rules, up to rotation. The symbol 
□ is represented as an empty square. The large 25x25 pattern is 
obtained by applying the substitution rules twice to I 
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infinite boards cannot overlap and each occupies at least some quarter-space, there 
can be at most four infinite boards in r, and since each has density 0, the density 
of points belonging to infinite boards in r is zero. 

R has unique derivation; indeed, the location of the corner tiles determine the 
derivation of a point. We denote by R the SFT associated to R by Mozes' theorem. 
We identify points in R with the point in R they are mapped to by the given 
one-block map; in general this identification is many-to-one. 

7.2. Turing machines and their representations in SFTs. A Turing machine 
is an automaton with a finite number of internal states which reads and writes 
data on a one-sided infinite array of cells indexed by N, called the tape. Each cell 
contains one symbol from the data alphabet (so in our model the input is an infinite 
sequence). The computation begins with the machine located at the 0-th (leftmost) 
cell and in a special initial state, and the tape contains some data which is the input 
to the computation. The state of the data tape along with the location and internal 
state of the machine are called a configuration; a configuration imiquely determines 
all future configurations. The computation proceeds in discrete time steps. At each 
iteration the machine is located at some cell, reads the symbol written there and 
based on this data and on its internal state, performs three actions: (a) it replaces 
the current data symbol with a new one, (b) it moves one cell to the left or to the 
right, and (c) it updates its internal state. The computation may halt after a finite 
number of steps if the machine either moves off the tape (steps left at cell 0) or 
enters a designated state, called the halting state. Barring these occurrences, the 
computation continues forever. 

Although a very simple model, any algorithm written in a modern computer 
programming language can be implemented as a Turing machine, and it is generally 
accepted that any effective computation can be performed by a Turing machine; 
this is Church's thesis. For background and basic facts on this subject, see [12]. 

Let X be the SFT constructed in section [6l let R be the SFT described above 
and let T be a Turing machine whose data alphabet includes symbols 0, 1. We 
construct an SFT Yt superimposed over X x R such that when a point y & Y 
is superimposed over (x, r) G X x R, each board induced by r has superimposed 
over it a pattern representing the run of T on the input given by the sequence of 
0, 1's appearing in x along the columns of the board. This construction, which we 
describe next, is similar to the one used by Robinson in [52, except that Robinson's 
machines always ran on an "empty" input. 

Let ^, p be symbols in the alphabets of X, R respectively. We superimpose new 
symbols over {£,,p) only if p represents a point in a board (i.e. p ^ D,U), and 
the adjacency rules for the new symbols will only restrict pairs of neighbors which 

18 



belong to the same board (note that this can be determmed locaUy). Thus {x,r) £ 
X X R will be represented in Yt if and only if for each (finite or infinite) board B 
induced by r there exists a locally admissible pattern superimposed over {x,r)\B- 

For a board Bn + u let us call the points In x In + u the nodes of the board. 
Note that p) represents a node if and only if p e {'~, ~', l, j, T, _L, h, H, +}. The 
data superimposed over a node will include a combination of data symbol (from 
the machine's data alphabet) and possibly also a machine state; this information 
may be represented by the alphabet Ai U (Ai x A2) where the union is disjoint, 
Ai is the machine's data alphabet and A2 its state space. 

Each row of nodes in a board is to represent a finite portion of the configuration 
of the machine. More precisely, each node will contain either a data symbol or a 
data symbol and a machine state; this is called the cell's configuration. Suppose 
X £ X and r G R induces a board B. We can arrange things so that 

(1) The data symbols in the nodes of the bottom row are the symbol or 1 
induced by x on that node. 

(2) The node at the lower left corner of B contains the initial state of the 
machine, and no other node in the bottom row contains a machine state. 

(3) Each row of nodes except the bottom one represents the configuration ob- 
tained by iterating the computation one step from the configuration given 
in the row below it. In particular, no row can appear admissibly above a 
row containing a halting state. 

Properties (1) and (2) are easily implemented by restricting the types of symbols 
which may be superimposed over (^, p) when p G {l, — , _L, j}. 

Implementing (3) with local rules requires a little more effort. First, note that in 
the course of the operation of a Turing machine T, the configuration of a cell i at a 
time t > 1 is a function of the configurations of the cells i — + 1 at time t — 1; 
indeed the data on the cell is determined by the configuration at i, and the presence 
and state of the machine depend on the configurations of the cells at i — 1 , i + 1 
(in case i = 0, the dependence is on the cells at j, i + 1 only). We write T{u, v, w) 
for the state of i at time t + I given that at time t cells i — 1, i, i + 1 were in states 
u, V, w respectively (we allow u ~ "null" in case i ^ 0). If we forget the geometry 
of the boards and imagine configurations of the machine represented as sequences 
of cell configurations stacked one on top of the other, this transition is "local" and 

v' 

can be enforced by a local rule that every pattern of the form must 

u V w 

satisfy v' = T(u, w, w). 

However, when we represent cell configurations in nodes of a board the transition 
from row to row is no longer local, since in a board the nodes representing successive 
cells are spread out in space and may be arbitrarily far apart. We can overcome 
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this by using the rows and columns between nodes (which belong to the board, and 
therefore do not overlap for distinct boards) to "transmit information" . In this way 
we can guarantee that the symbol superimposed over the immediate neighbors of 
each node indicate the cell configuration at each of the neighboring nodes. This 
can be implemented in a manner similar to the way in which we synchronized 
the coloring of 2-nets in X in section [6] Briefly, over each grid point marked — 
we superimpose a pair of symbols (u, v) where u, v are node configurations. We 
require that each pair of horizontally adjacent — 's are marked with the same pair, 
so all members of an uninterrupted horizontal sequence of — 's carry the same pair. 
When a pair H — appear and + has configuration u we require that over — there 
is a pair {u,v) for some v] and similarly for pairs _L — and T^. The symmetric 
condition is imposed for — h, — -L and — T. The result is that every uninterrupted 
horizontal sequence of — 's carries the pair (w, v) where u is the configuration of the 
node at which the sequence ends on the left, and v the configuration of the node 
ending the sequence on the right. 

Next, over each symbol | we superimpose a pair {u,v,w), where u,v,w are cell 
configurations and u or w may also be "blank". As for — 's, we require that the 
marking is constant for each uninterrupted vertical sequence of |'s. The markings 
are determined as follows. If a | is located immediately above a node with configu- 
ration V, and the nodes to the left and right of that node have configurations w, w 
respectively, then | carries (w, u or w are be "blank" in the case there is no 

node to the left or right of the node below | (i.e. if it is at the edge of the board). 
Note that by the previous discussion, u,v,w may be determined by looking at the 
immediate neighbors of the |. Thus the column of |'s above each node represented 
the configuration of that node and its neighbors. 

Finally, we require that when a node in state v' appears vertically above a 
I marked {u,v,w), then v' = T{u,v,w). These conditions can be seen to force 
property (3). 

We summarize this construction and its properties in the following proposition: 

Proposition 7.1. Given the systems X,R from sections\^ and \7.1\ respectively, 
and given a Turing machine T , there exists an SFT Yt superimposed over X x R 
such that the following are equivalent: 

(1) (x,r) Cz X X R is represented in Yt- 

(2) For each finite or infinite board B induced by r and containing the symbol 
L, when T is run on the sequence of Q, 1-s induced by x on the columns of 
B the number of steps it runs without halting is at least equal to the number 
of rows in B . 

Furthermore, h{YT) = 0. 
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Proof. The equivalence follows from the discussion preceding the theorem. We only 
note that if a board B induced by r does not contain the symbol l then it can always 
be extended, e.g. by a pattern in which all rows are the same and contain only 
data. Note that in general, there may be infinitely many ways to superimpose a 
pattern over an infinite board which does not contain l. Thus the projection from 
Yt into X X R is not an injection. 

It remains to check that h{YT) = 0. Given an N x N pattern a appearing in 
X X R, if Bn + u is a board induced by R and contained in Fj^ then there is a 
unique way to extend a to a locally admissible Yt pattern. This is true also for 
symbols in a which do not lie in any board. Given e > 0, a simple estimate shows 
that if N is large enough these points make up all but an e-fraction of the points in 
Fn, the remaining points coming from "boards" which intersect the boundary of 
Fn or infinite boards, all of which have density tending to zero as iV ^ oo. Hence 
a can be completed in at most 2^(^''^^ ways with e{N) 0. It now follows that 

therefore 

HYt) < \ lim N^^j,{F^) + lim e{n) < h{X) + h{R) = 

n— >oo ^'^-n. n^oo 

as claimed. □ 

7.3. Pruning. Our aim now is to find a Turing machine T so that (x, r) S X x i? 
is represented in Yt if and only if 5{x) < h. 

Recall that this machine T will receive as its input sequences of 0, 1's induced by 
points X G X on translates of /„. Write / = Uj^]^/„, and enumerate the elements 
of / as / = {ii < 12 < ■ • ■}, where ii = 1. Note that the first 4" elements of 
this sequence are precisely the elements of /„ ; this follows easily from the recursion 
relation defining the /„'s. If {x,r) G X x R and i?„ + u is an n-board induced by 
r, then the 0, 1-coloring induced by a; on /„ x {1} + u is the sequence {xj)^^^ such 
that Xj is the symbol or 1 appearing on the ij-th cohmm in T^x. It follows that 
for any k < n, the first 4*"' symbols of this sequence correspond to a pattern induced 
by X on some translate of Ik- 

Lemma 7.2. There is a sequence of finite sets A/„ C /^^.jn^ such that {i„i : m g 
M„} is a complete set of residue classes modulo 2", i.e. for every < j < 2" there 
exists a unique m G Mn such that im = j mod 2" . 

Proof. By the recursion formula for 1^ given section 17.11 and the fact that Ik is an 
increasing sequence, for any k < q we have 

4 + 5« c /, + 59 C 
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In particular, since 1 G /i, we may show by induction that for any r and 

1 + 59 + + . . . + 5*9 C /,+2,...+tg+l 

Given n, since gcd(2", 5) = 1 we may choose q < 2" so that 5' = 1 mod 2". Since 
the set 

{l + S' + .-. + S'^Ci 
is a complete set of residues modulo 2" and is contained in l23n ; the existence of 

Mn follows. □ 

It is clearly possible to compute a sequence of sets M„ C 12^^ with the above 
properties. The proof above gives an algorithm for going so, since the identity 
5'' = 1 mod 2" is solved by q = 0(2") (here is Euler's function). 

Let r{n) be a computable sequence and h < r{n) — > h. We can now describe our 
algorithm: 

Algorithm 7.3. Input: (.T„)„gz e {0,1}^. 
For iV 1,2,3... do 

(1) Calculate r{N). 

(2) Calculate the relative frequency Sn of 1 's in the sequence {xm ■ fn G Mn), 
i.e. 



1 

2 

(3) If6N > r{N) + 2-^ then halt. 



= T;pf#{lTl e Mn : Xrn = 1} 



Proposition 7.4. Let x X and let {xn)'^=i be the 0, 1-valued sequence with Xn 
equal to the color of the i„-th column of x. Then the algorithm \ 7.3\ halts on the 
input (xn) if and only if S{x) > h, and if it halts the number of steps it runs before 
halting depends only on S{x) (not on x). 

Proof It sufHces to show that S{x)~2-^ < Sn < S{x) + 2-^ for every N. Indeed, if 
S{x) ~ h+e for some e > then Sn > S{x) — 2~^ implies that Sn > h — e/2 for large 
enough N, and since r{N) h for large enough N we will have Sn > r{N) + 2~^ 
and the algorithm will halt. On the other hand if S{x) < h then Sn < S{x) + 2~^ 
implies that Sn <h + 2"^ < r{N) + 2"^, so the algorithm wiU run forever. 
Fix N >l and let E = U^iC/„ x I4 be the 2-net induced by x. Note that 

oc 

J(x) = ^p„2-" 

where pn G {0, 1} is the symbol induced by x on the grid [/„ x Vn- 

Note that |M„| = 2". Let J„ = {j e Mn : ij G [/„}. Since {i^ : j e Mn} is a 
complete set of residues modulo 2^, for each n < N we have 

I J„| = 2^"" 
22 



and since the J7„'s arc pairwise disjoint so are the J„'s, so 

N 

\Mn \ U -^"I = 1 



ri=l 



Let Mpf \ yJn=iJn = {i} and let p' E {0,1} be the symbol induced on the i-th 
column of x. Then 



S 



N 



1 ^ 



2JV 

jeM_ 



N 



n=ljGJ„ 

P^.\^ Pn\Jin)\ 

ri=l 
N 

n=l 



The desired inequality follows. 

Regarding the number of steps the algorithm runs before halting, this depends 
only on N and 5{x). □ 

Let T be a Turing machine implementing this algorithm and whose input is 
the sequence of O's and I's which is the input to the algorithm. We make one 
important assumption about the implementation, namely that there are integers 
tj4 such that the machine performs the first N iterations of the loop in at most 
In steps (or halts before that), independent of the input. Such an implementation 
does not present any difficulty. Another thing to note is that as we have defined 
it, the entire tape is taken up by input data. In order to provide the machine with 
space to store its intermediate calculations one can allow it to superimpose another 
layer of symbols over the input alphabet. Formally, this can be done by setting the 
machines alphabet to be {0, 1} x {0', 1'}, with the input represented by the first 
coordinate and the machines modifying the second coordinate as it pleases. 

Let Y — Yt] this is the system Y whose construction was the goal of the second 
step in the outline given in section 01 

Proposition 7.5. // (x, r) e X x i? then {x, r) is represented in Yt if and only if 
S{x) < h. 

Proof. By 17.11 it suffices to show that the condition S{x) < h is equivalent to the 
fact that for any finite or infinite board B induced by r representing an N x N 
grid {N e N U {00}), if rjs contains the symbol l then the algorithm does not 
halt after N steps when run on the input induced by x on the columns of B. The 
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proposition now follows easily from proposition 17.41 and the fact that r induces 
boards of arbitrarily large size. □ 



Finally, we note that the topological entropy oiY = YT is zero by proposition 

o 



8. Step 3: Adding and calculating entropy 

Let Y be the system constructed in the previous section. Let Z be the SFT 
superimposed over Y by adding one of the symbols a, (3 over each occurrence of the 
symbol 1. We place no other restrictions on the configurations of a, /9's which may 
appear. In this section we estimate the entropy of Z and show that it is indeed 
equal to h ~ sup{(5(j/) : y e Y}. 

Write F„ = {1, . . . , n}^ again and for y G Y denote 

fniy) = ^#{" e : yiu) is marked "1"} 

\^n\ 

SO S{y) = limsup„^Q^ fniy)- Since S{y) < h for every y G Y there is a sequence 
En such that 

sup fniy) < h + s„ 

(such a sequence e„ exists by general considerations, but in our case by the proof 
of proposition 17.41 one can choose e„ = 2~"+^). 

Wc now estimate the number of patterns induced by Z on the box Fn = 
{1, . . . , n}^. For each pattern induced on Fnhy y £ Y , the number of ways to super- 
impose the symbols a, b and get an admissible pattern for Z is 2-^"'^^'l^"l = 2^" 
Summing over all patterns induced on Fn by Y and using the fact that /„ (y) < h+Sn 
we have 

NziFn) < NriFn) ■ 2-^"('')"' < 7Vy(F„) • 2"'(''+^") 

so 

limsup log NziFn) < limsup log A^y(i^„) + limsup(/i + £„) = h 

n — ^oo \-^n\ n — *oo \-^n\ n — *oo 

because hiY) = hmsup log A^y(F„) = 0. 

On the other hand, if ?/„ G Y satisfy /n(t/n) ^ h then clearly the number of 
ways to extend yn\F„ to a pattern in Z is 2-''"'^^")l^"l and so 

liminf -^logA^2(F„) > limsup/„(y„) = h 

The entropy estimate hiZ) = h follows. 
This completes the proof of thcorcm ll.il 
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9. Concluding remarks 



Many questions remain about the relation between the dynamics SFTs and their 
entropies. Let us take a closer look at the system Z constructed above. We can 
write Z as a disjoint union Z = Uo<r<hZr where Zr is the (nonempty) set of points 
z Cz Z with S{z) = r; each Zr is a closed shift-invariant set, so every orbit closure in 
Z lies in some Zr- Hence Z is not transitive. Z also does not have periodic points, 
since it factors onto the infinite uniquely ergodic system W. 

We remark that if h is computable instead of merely right recursively enumerable, 
then one can modify algorithm 17.31 so as to also kill points whose density of I's is 
less than h (computability implies both right and left recursive cnumcrability). For 
this algorithm the resulting system is essentially the system Z^ above. However, it 
is still not transitive, since there are many ways to extend an infinite board which 
does not contain a bottom row; this does not affect entropy, since infinite boards 
have density zero, but means that has a transient part. 

Problem 9.1. Is every right recursively enumerable number h the entropy of a 
transitive SET? 

Conversely, we have seen that the entropy of irreducible SFTs is computable. 
This raises the following: 

Problem 9.2. What is the class of entropies of multidimensional irreducible SFTs? 

Another mechanism which may be related to entropy is the presence of periodic 
points. For a two-dimensional SFT X denote by Pn the number of n x n patterns 
which can be repeated to produce an admissible tiling of the lattice with period 
k in both directions. Clearly P„ is computable, and in certain situations one can 
show that lim log P„ h and log P„ < ft. + for a sequence £„ which decays 
to at a computable rate. This implies that the entropy is computable, because 
for Nn as in section |3] we have h £ logP,i — En, log A^„), and so given n we can 
examine the difference ^ \ogNn — logP„ — e„) for n — 1, 2, 3 . . ., stop the first 
time it is less than i, and give ^ logP„ — e„ as our estimate. 

Friedland [S] used this observation to deduce that if the syntax of an SFT enjoys 
a certain spacial symmetry then the entropy is computable. We note also that 
irreducible SFTs in two dimensions have dense periodic points, but whether this is 
so in higher dimensions seems to be open |26| . 

Problem 9.3. Do dense periodic points for an SFT imply that the entropy is 
computable? 

Finally, we repeat here an old question which we mentioned in the introduction: 
Problem 9.4. Is every sofic shift a factor of an SFT with the same entropy? 
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